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Abstract 

We consider multi-type Galton Watson trees, and find the distribution of 
these trees when conditioning on very general types of recursive events. It turns 
out that the conditioned tree is again a multi-type Galton Watson tree, possibly 
with more types and with offspring distributions depending on the type of the 
father node and on the height of the father node. These distributions are given 
explicitly. We give some interesting examples for the kind of conditioning we 
can handle, showing that our methods have a wide range of applications. 


1 Introduction 

The asymptotic shape of conditioned Galton-Watson trees has been widely studied. 
For example, one could condition on the number of nodes of the tree being n, and 
letting n ^ oo. In a lot of cases, the limiting tree is quite well understood, see for 
example the survey paper by Janson [Ij. Some work on finite conditioned trees has 
been done by Geiger and Kersting [2], who studies the shape of a tree conditioned 
on having height exactly equal to n. In this context, we also mention the spinal 
construction of a Galton-Watson tree conditioned to reach generation k, as derived in 
0 . 

In this paper, we will investigate conditioning multi-type Galton-Watson trees on 
events of a recursive nature (as explained in Section[2]), one example being conditioning 
on survival to a given level. The main idea is that we consider different classes of trees, 
where the class of a tree is determined by the types and classes of her children. The 
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offspring distribution of a node depends on its type and on the level of the tree where 
this node is living. In fact, we show that the conditioned tree again is a multi-type 
Galton-Watson tree and how this can be used to directly construct such a conditioned 
tree. Our approach can be seen as a generalization of the well-known decomposition 
of a supercritical Galton-Watson tree into nodes whose offspring survives forever and 
nodes whose offspring eventually goes to extinction as discussed in |S]. 

Section 12.21 discusses a couple of examples that illustrate the applicability of our 
results. We give an example concerning mutants in a population, we discuss an 
alternative to Geiger’s construction of a tree conditioned on having height exactly k 
and we show how to condition on the size of the /cth generation. 

1.1 Notation and preliminaries 

We will consider rooted multi-type Galton-Watson trees with arbitrary offspring dis¬ 
tribution, that can depend on the current generation. In such a tree each node has a 
type, which we indicate by a natural number t G 0 := {1,..., 0}. If the root of a tree 
has type t, we use a bold-face t to denote this root. Dehne the set of trees of heigth 0 
as To = {1, 2,.. ., 6}. Then we dehne inductively for k >1 the set of trees of height 
at most k by 

OD 

Tfc = To U I_I {(t, [Ti ,..., Tn]) I Ti G Tfc-i, t G 0}, 

n=l 

and denote the set of all trees by T = U^o"^- ^ ^ ~ • • • jlk]) £ 

0 X [Tk-iY-, the trees Ti,... ,T„ will be called the children of T (notation: Tj -< T). 
The type of T will be just the type of its root and will be denoted by r{T). We now 
dehne a function iV : T —)• N® that counts how many children of each type a tree T 
has: 


iV(T) = (W(T),...,iV,(T)), where W(T) = # |t ^ T : r(T) = t} . 

The set of trees of heigth at most k having a root of type t is denoted by 

OO 

7t = {Te%-. r(T) =t} = {t} U [Ti,..., TJ) I T, G Tk-i}. 

n=l 

Let T* = Denote the ohspring distribution of a type t node at height I > 0 

by fij, for arbitrary probability measures /i* on = {0,1,...}®. Dehne indepen¬ 
dent random variables Wf ~ p*. For a vector x G we write the corresponding 
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multinomial coefficient as 


D(x) 


|x|i 

Xi ,. . . , Xq 



We now introduce the Galton-Watson probability measures on 7^*. Firstly, let Pq 
be the trivial probability measure on 7^*, so Po(t) = 1. Now dehne inductively the 
probability measure P/^ for 0 < / < fc and fc > 1 as the following probability measure 
on T^_i: ii I = k, then P^^ = Pq. Otherwise for all T G Tj^_i 


Pk{T) 


P(tv,‘ = N{T)) ^ ,f, - 

D(N{T)) J-J- 

T-<T 


where empty products are taken to be 1. The intuition is that the second sub-index 
determines the size of the hnal tree we are considering, whereas the hrst sub-index 
determines at which level we are building up the tree (so P/^ generates trees of type 
t at level I of size k — 1). We are interested in Pq^, which is the Galton-Watson prob¬ 
ability measure on 7^* (trees cut off at height k with a root of type t). 


In the next section we will introduce a class of recursive-type events on which we would 
like to condition, and discuss several examples of such events. In Section [3] we will 
introduce the conditional measures corresponding to our events, and in Section 0] we 
show that these conditional measures indeed coincide with the original Galton-Watson 
measure, conditioned on our event. 


2 Conditioning on recursive events 

In this section we introduce a class of recursive-type events on which we would like 
to condition, such as the event that the tree survives until a specihc level. 

2.1 Partitioning the set of trees 

We will now set up our general framework and show how some examples £t into it. We 
start by choosing /cq G N and partitioning Tko into m classes ^..., Typically, 
all trees in such a class have some property that all trees in the other classes do not 
have. One of the simplest examples would be a partition into two classes, where trees 
that survive until some level k are in the hrst class and all other trees in the second. 
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The partition of Tk^ will be the starting point to recursively dehne partitions of 71, 
ko < I < k into sets = 1,..., m, where k is the (maximum) height of the trees 

that we are considering. Suppose that the partition of 7I_i is already dehned. Then 
we are able to introduce a counting matrix for trees in 71- Dehne Q : 7/ —)■ such 

that for T G 71 the (i, j)th position is given by 

= ^T:f e n r^'} , 

so this is the number of children of T having type j and being an element of the ith 
partition class. Now for ko < I < k we partition into subsets Bi^i,..., Bi^m- 

This partition is the key for the recursive dehnition of A\^\ The set will contain 
exactly those trees for which the counting matrix Ci{T) is in Bi f 

Af'> = {Terr.Ci{T)eBi,}. (2.1) 


2.2 Examples 

Before going into the details of the construction of conditioned trees, we will discuss 
some examples of recursive events that can be handled by our approach. 

2.2.1 Genetic mutations 

Suppose we have a population in which sometimes an individual (mutant) is born 
having a particular mutation in its genetic material. This mutation can be inherited 
by subsequent generations. Suppose we know the probability that the root is mutated. 
Such a population can be described as a two-type Galton-Watson process in which 
the offspring distribution is type- and possibly level-dependent. We take 0 = {1,2} 
to be the set of types, where mutants have type 1. 

Suppose we would like to condition on the event “there is at least one mutant in the 

kth generation”. Choose /co = 0 and partition % = (1, 2} into the classes = {1} 

and A^q^ = {2}. For 0 < I < k, we want to dehne A[^^ and by 

^ 7 ^ ■ there is a mutant at level /} , = Ti\A[^\ 

These events satisfy a recursive relation: contains exactly those trees that have 

at least one child in For T G 71, the hrst row of the 2 x 2-counting matrix Ci{T) 
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counts the children of T that are in A 


( 1 ) 

i-i- 


Therefore, for all 0 < / < A; we let 


Bu = 


a b 
c d 


e : a + 6 > U , A ,2 = 


and now fl2.1l) gives the desired partition of Ti. For the sake of illustration, we note 
that with a minor change, we can condition on “there is at least one mutant in the 
kth generation inheriting its mutation from the root”. To achieve this, it suffices to 
merely redefine Bi i and B 12 for all 0 < / < /c as follows 




a b 
c d 


e : a > 1 '>,Bi ^2 = W^"\Bi^i 


In these two examples, we dehned one partition class ^ 71 by the event on which 
conditioning is required. The only other partition class was just the complement of 
the hrst one. Finding a suitable partition of the set of trees is not always that obvious, 
as is demonstrated in the next example. We will show how to condition on the slightly 
more complicated event “All mutants in the tree inherit their mutation from the root 
and at least one mutant is present in generation fc”. As before, define one partition 
class as the set of trees satisfying the condition. Here it is not sufficient to define 
only one other partition class. One obstacle is that some trees (namely those with a 
“spontaneous mutation”) in the complement 71 \ are forbidden as a child of trees 
in A;^\ and others are not. 

Nevertheless, with a slightly more elaborate partition, we can still handle this case. 
We distinguish four classes and partition To into 


Aq^^ = {1} , A'o' = {2} and A, 

For 0 < I < k, we define the following subsets of 

a 0 
b c 
0 0 
d 0 


1(2) 


( 3 ) 
0 

4x2. 


= Ai^) = 


-Bz,3 — 












f 

a 

b 

” 


c 

d 



e 




. 9 

h 



a > 1 


b+e+f+h>l 


Bl2 — 


Bi^i — 


r 

1- 

0 

0 

1 

0 a 


0 0 

i 

-1 

0 

0 

f 

1- 

0 

0 

1 

a b 


0 0 

i 

-1 

0 

0 


; a + c > 1 


As can be easily checked, these sets are disjoint and so this indeed is 

a partition. It follows by induction that the sets a 1*^ partition 71 in such a way that 
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• 0 < Z < fc contains exactly the trees having 

— at least one mutated child of which the mutated progeny reaches level I, 
and 

— no “spontaneous” mutants in the progeny of their children. 

• Ai ,0 < I < k contains the trees having only type 2 descendants. 

f3) 

• Ai \ 0 < l < k contains the trees having a type 2 descendant with a type 1 child 
(“spontaneous mutation”). 

• A['^\ 0 < I < k contains all other trees in 71. 

Note that these classes are dehned by properties of the children of a tree and not by 
the type of the tree itself. For example, a tree in A] ^ can have a type 1 root, but 
all its descendants have type 2. The conditional measure we are interested in is now 
obtained by conditioning Pg\, on A\}K 

2.2.2 Conditioning on the size of generation k 

As a next example, we show how to condition a single-type Galton-Watson tree on 
having exactly G individuals in the Zcth generation. In this case, we partition To = {1} 
into G -|- 2 classes by dehning 

4'^ = {1} , 4°^ = A® = A® = ... = Af ^ = Af= 0. 

Dehne x G by x := [0 1 2 ... G G -|- 1]^. For 0 < Z < Zc, we dehne 

Bi, = {ye : x^y = i] , = {v ^ > G + l} , 

where 0 < i < G. Partitioning 71 according to fl2.ip gives the following: for 0 < i < G, 
A^p contains the trees of which the Zth generation has exactly size Z, while A^^^^ 
contains the trees of which the Zth generation has at least size G -|- 1. Conditioning 
Pok on A), ^ gives the result we are looking for. 

2.2.3 The tree has heigth exactly k 

As a hnal illustration, we explain how to condition a Galton-Watson tree on having 
height exactly Zc, thus producing an alternative for the construction of Geiger and 
Kersting [2]. We consider trees in Tk+i that are conditioned to reach level k, but not 
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level k + 1. We start by choosing ko = 2, and partitioning T 2 into three sets, namely 
correct trees, short trees and long trees: 

= {T G 72 I T reaches level 1, but not level 2}, 

^ 2 ^^ = {T G 72 I T does not reach level 1} = {0}, 

= {T G 72 I T reaches level 2}. 

Dehne for each 2 < I < k + 1 

Bi^i = {n G I ni > 1, ns = 0}, 

Bi ,2 = {n G I ni = 0, ns = 0}, 

Bi,3 = {n G I ns > 1}, 


and let 71 be partitioned as in fl2.ll) . This construction guarantees that if a tree 
T G 7fc+i is an element of ^^+ 1 , then it has at least one child that reaches level k, and 
no children that reach level k + 1. If T G ^^+ 1 , ah its children do not reach level k, 
and if T G then at least one child reaches level k + 1. Conditioning on being in 

therefore gives the desired result. 

2.3 Remarks following the examples 

As it turns out from the examples in the previous section, the setup allows to condition 
on quite a variety of events. A fundamental requirement on these events is that they 
are determined only by the number of children of a tree having particular properties. 
So we can (for instance) not distinguish between trees having the same children in a 
different order. 

An additional example is discussed in detail in [1]. As an application of the theory de¬ 
veloped in the present paper, the cost of searching a tree to a given level is determined. 
The proposed model takes into account costs for having a lot of children, but also for 
walking into dead ends. So both a high expected offspring and a low expected off¬ 
spring would give high search costs. This gives rise to an optimization problem: which 
offspring distribution gives minimal costs? For this model the conditional probability 
measures are explicitly constructed, leading to recursions that enable us to calculate 
the costs and solve the optimization problem for Poisson offspring. 

Conditioning on recursive events as in the examples allows us to compute (conditional) 
probabilities that are dehned in terms of such events. As an illustration: in the 
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example on genetic mutations we can easily compute the probability that the root is 
mutated, given that there is at least one mutant in generation k. What makes the 
results even more useful is that they show how to directly construct a tree conditioned 
on some event. This means that trees conditioned on (rare) events can be studied by 
just simulating them. 


3 Conditional measures 


In this section we construct an alternative measure on T^_i, that depends on the 
event we want to condition on. As soon as we have this measure, conditioning on the 
desired event is a triviality. In the next section, we will show that in fact the two 
measures P;). and are the same. 


Dehne for 0 < / < /c — /cq by 

'pfPPk(At,nr). 

We can calculate this probability in a recursive way. Denote, for q G [0,1]™ with 
by Multi(n, q) G the multinomial distribution where we distribute n 
elements over m classes, according to the probabilities g*. We also choose independent 
random vectors Wl ~ according to the offspring distribution of a type t node at 
level I and denote the jth coordinate by hh^h. Then, for / < /c — /cq 


0=1 


pH = p I (K) (Tj- ■ ■ ■. ) e 


(m) 


(3.1) 


where, for qq, ... ,ag G N™, we dehned A := 0^=1 ^ to be the matrix for 

which Aij = aj{i). 

We proceed by dehning the conditional measure on All'Ll HT*. To do this, dehne 
for each t G {1,..., 6*} and 0</</c — /cq — Ion the same probability space as W/, 
the random matrices 


/ -.Yiy 


‘A' 


Ik 






Ik 


\ 


\ w 


(m,l) 

Ik 




(m,9) 

Ik 


such that conditional on hh/, all columns are independent and the distribution of the 
jth column satishes 


(W 


(l-t) 

Ik ) 


'X 


Ik ) 


W! ~ Multi 




i^Pl+l,ki 


j (™) N 

) Pl+l,k, 



This determines the full joint distribution of {Wf^^Xik). For a type t node at level 
I, the distribution of its children over the 6 types is given by the random vector Wf. 
Furthermore, the jth column of ^Xik represents how the type j children of this type t 
node are distributed over the m classes. For I = k — ko, we dehne for each T G nT* 




k—ko,k 


(T) = 




pt 
k—ko,k \ 


(T;! n T‘) 


as a probability measure on fl T*. Next, we inductively dehne the probability 
measures on for each 0 < I < k — ko — 1 such that for each T G 


‘(3®(r) 


F(‘A',t = Ct-,(T) I ‘X,t e Bl’t) 
D(Ci-,{T)) 


n If 


i=i 


f^T:TeA' 


U) 


where we extended the dehnition of D (see Section ll.lh to integer-valued matrices, 
and once again empty products are taken to be 1. Note that this dehnition is valid 
for all T G T^_i- we simply get = 0 whenever T ^ ^k-i- dehne 

the alternative measure P/;. on 


pikiT) = 


3.1 Construction according to the conditional measure 

We can describe the random tree T ~ P/^ as follows. The root of the tree has type t. 
To construct the tree, we hrst toss an m-sided coin to determine in which of the m 
classes T is, giving probability to the ith class If T G ^^1;, then we choose 

it according to • This means that we choose where counts the 

numbers of children of T of each type and ^Xik counts for each type the numbers of 
children that will lie in each of the m classes, according to 

The Yhij children are distributed over the ^ik,j positions uniformly at ran¬ 
dom. Then for each child of type j in we draw a tree according to k- 

In this way we have described the random tree as a Galton-Watson tree with m x 6 
‘types’ of children and type- and level-dependent offspring distribution. Note that 
conditioning on 74^1^ is trivial: we simply have to draw T according to *0;^ . 
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4 The two random trees are equally distributed 

The following theorem shows that the construction procedure of Section [3] in fact gen¬ 
erates trees with the same probabilities as under the original Galton-Watson measure. 
Fix /c and ko and dehne all measures as before. 

Theorem 1 For all 0 < I < k — ko, t ^ Q and T G T^_i, 

p,\m = PL(.T). 


Proof: The theorem is true by construction for I = k — k^. Now suppose that we have 
already shown that ^ ^ for all t. Choose T G Tk-i and suppose T G ^^1;. 

Before we show that PlkiT) = Ph^iT), we collect some useful observations. First of 
all, note that the number of ways to distribute the individuals over the positions in 
Ck-i can be written as a product by hrst assigning a type to each individual and then 
distributing all individuals of a given type over the classes (writing Ck-i for Ck-iiT) 
and N for NiT)): 

D(a-,) = c(/v)((c<«>)” ). 

i=i 

Secondly, note that G 5^2/) is equal to 


5 ; P {w; = (m,. .., n»)) P 0 Multi (n„ „ ... ,)) e B,_u 

(ni,...,n£))GN® \j=l 

= p (0Muiti (w',y epllh,...,y!rh)) e ft-t* 

\j=i 

and by fl3.1|) this is exactly Next, since Ck-i{T) determines N{T), we have: 

FCXik = Ck-i) = F{^Xik = Ck-i,W^ = N) 

= P (W/ = N)¥ {^Xik = Ck-i I Wl = N) 


p(w/=iv)np 


j=i 

e 




i=l 


'k-l 


i=l 


i=i 


p {wi = iv) jifl j n (■'pSi. 


i=l 


(i) 

,k 




(“j') 
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nWl^Nm^frfv 



C 


(i.j) 

k-l 


Combining these observations gives 


pl(t) = ‘piboJiV) 


, P(‘A'„ = C\_,(T) I ‘X„ e b£,) tT 

Plk 77FV^ 11 11 


r{f)n^ 3 ) 




D{Ck-iiT)) 

{^Xik = Ck-i{T)) 


Qr;,M^T) 


i=i 


t^t-.tgA 


U) 


D(CUTm‘X,t € 


'b!ib(r) 


P(»7 = N) 
D(N) 

P (W7 = N) 
D{N) 

P (W7 = N) 
D{N) 


f T.j~. 9 m ^ ^ r'O’*)' 

— (j) \^k-i 


nn 


t=ii=i 


nn n 

i-i tee T-<T:r6A^-’2;_inr* 


nn n 

j-i tee f’-<T:f 6A^-’2(_inT* 


/„ \ 

nn n ‘«abr) 

^i=ite0f'-<T:rGA^-’2;_inr* y 


'p,«*‘Qby(T) 


PMith TT z,PP) 


D(iV) 


n PmUP) 


T^T 


= PLiT). 


□ 


5 Example: genetic mutations revisited 

In this section we use our results to work out one of the examples of Section 12.21 For 
these calculations it will turn out to be very useful that our conditioned tree is again 
a Galton-Watson multitype tree. We consider a population with mutants and let the 
set of types be 0 = {1,2}, where type 1 denotes a mutant. The number of children 
of a type t node will have a Pois(/rt) distribution and each child has probability pt to 
be a mutant itself, independent of all other children. 
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We will condition on the event that there is at least one mutant in the kih. generation. 
The corresponding partition of 71 is given by 

= {T G 71; there is a mutant at level /} = {T G 71: Ci{T) G A,i}, 

Af' = TAaA', 

where Bi i is the set of matrices 

c a 

Remember that this means that a tree in 71 is an element of if and only if it has a 
type 1 or a type 2 child in We will now derive the recursions for the probabilities 
• A type t subtree that starts on level I < k has two types of children, and each 
child is in one of the two classes. Type and class of a child are independent of all 
other children’s properties. So we can introduce four new ‘types’, occurring according 
to the following distributions: 

~ Pois fVi+gk ■ Pt ■ ~ Pois f^p!+\,k ■ (1 - Pt) 

~ Pois (Vi+pk ■ Pt ■ /^t) , ~ Pois (^Pi+\,k ■ (1 - Pt) ■ /^t) , 

all independent of each other. In this notation stands for the number of type 

6-children in class a of a type t node at level /. Note that the intensities indeed add 

up to Hf For instance, the probability that a mutant (type 1) on level I does not 
generate a mutant on level k satishes 

and similarly it follows that 

2p(2) ^ g-/^2(T(+\,fc-P2+^p(+\,fc-(l-p2)) 

The corresponding initial conditions are 

1 (1) 2 (2) T 1 (2) 2 (1) n 

Pkk = Pkk = Pkk = Pkk = 0- 
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Figure 1: Dashed: Vofc • Solid: Dotted: survival probability of a tree with 

Poisson(3/2) offspring. 


Figure [H shows the behavior of these probabilities for the following choice of param¬ 
eters: mutants reproduce at rate /ii = 1 and non-mutants at rate /i 2 = §• Mutants 
can only generate mutants (pi = 1) and a child of a non-mutant has a very small 
probability to be a mutant, p 2 = 10“®. The dashed line shows the probability that a 
tree with mutated root has a mutant on the /cth level as a function of k. This is a 
critical tree with only mutants that eventually goes extinct. The solid line shows the 
probability that a tree with non-mutated root has a mutant on the /cth level. This tree 
is supercritical, with reproduction rate (very close to) |. In a tree with Poisson(/i) 
offspring, the extinction probability is the non-trivial solution of 


s = 


OO 


E 

n=0 


n\ 




For /i = |, this gives s ~ 0.417. The corresponding survival probability is given as a 
dotted line in Figure [T] This indicates that if the tree does not die out, then eventually 
there will be mutants almost surely, since the population grows exponentially. The 
population is of order 10® around generation 51, which explains the location 

of the increase of the solid line. 
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We will now consider a tree with a non-mutated root, conditioned on having a mutant 
on level k. We will show how to compute the expected number of mutants on each 
level in such a conditioned tree. A type t subtree at level I is in class 1 if and only if 
its counting matrix is in This corresponds to conditioning the distributions in 

dEU) on 




Write A for and Y for Then 

E[A] = E[A I X + F > 1] ■ P(A + F > 1) + E[A | A + F = 0] ■ P(A + F = 0). 
Since E[A | A + F = 0] = 0 and P(A + F > 1) = we obtain 

E[A] 


-Ik 
t 


+> 1 - 


E[A I A + F > 1] = 
And analogously: 

E[F I A + F > 1] = 


1 ( 1 ) 


P(A + F > 1) 




E[F] 


2 jl) 


PzVl,fc • (1 - Pi) ■ Pi 


^Pfk 


P(A + F > 1) 

The conditioned tree is in fact a four-type Galton-Watson tree: 


type a : 

mutant, class 1 

type /3 : 

mutant, class 2 

type 7 : 

non — mutant, class 1 

type 5 : 

non — mutant, class 2 


Abbreviating ^ by and (1 — pj) by g*, the expected offspring of a node at level 
I < k described by the following matrix: 


Mik = 


/ V -pi -/il/Vzfc 
■ Pi ■ Pi 

^P^ ■ Qi ■ Pi/Vzfc 

V ^P^ ■ gi ■ Pi 


0 

V • Pi■ Pi 

0 

2p2 ■ gi ■ Pi 


■ P2 ■ P2/^p\k 0 \ 

• P2 ■ P2 • P2 ■ P2 

2pi ■ g2 • P2/Mfc 0 

2p2 ■ g2 ■ P2 ^P^ ■ g2 ■ P2 / 


In this matrix, the columns give the expected offspring of a type a, /?, 7 or 5 node 
respectively at level I in the tree. Now the conditioned tree we are interested in is just 
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Figure 2: Expected number of mutants in each generation of a tree conditioned to 
have at least one mutant in generation 60. Solid: the root is not a mutant. Dashed: 
the root is a mutant. 


a tree with a root of type 7 . The types a and {3 correspond to mutated individuals. 
Therefore, the expected number of mutants on level I in the conditioned tree is given 
by 

/ 0 \ 

0 


(110 0) . . . Mo^fc 


1 

VO/ 


See Figure [21 for a plot of these expected numbers as a function of the generation. 


As a last example, we computed the expected total number of individuals in a tree with 
a non-mutant root and conditioned to have no mutant in generation k. If k is small, 
occurrence of mutants is unlikely anyway, so then the tree just grows exponentially. 
If the population grows beyond order 10®, then the condition has a serious influence 
on the expected size of the tree. For example, taking fc = 60, the population size in 
the unconditioned tree would be of order 10 ^®, but in the conditioned tree it is only of 
order 10®. See Figure |3l left plot. For even larger k the condition to have no mutant 
in generation k is very restrictive. See Figure [31 right plot. Apparently, the condition 
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Figure 3: Expected size of each generation in a tree conditioned to have no mutant 
in generation k. Left: k = 60, right: k = 90. In both cases, the root is not a mutant. 
Note the huge difference in expected population size. 


more or less forces the tree to die out early. In the unlikely case that it survives to 
generation 90, the population stays small for a long time. The minimal expected size 
is 10“^ individuals and is attained around generation 40. After that the population 
starts expanding. The later an individual is born, the less its progeny is influenced 
by the condition that no mutant is present in generation 90. This also explains the 
increase in the curve that is seen at the end. 


6 Conclusion 

We have demonstrated how to condition multi-type Galton-Watson trees on events 
having some recursive nature. More specihcally, we looked at partitions of the set of 
trees in which each partition set is dehned by some tree property. A crucial aspect 
of these properties is that they are determined completely by the types of children of 
the tree and the partition sets to which they belong. As our examples show, there is 
a wide variety of events htting into this framework. 

We have shown that such a conditioned tree itself is again a multi-type Galton-Watson 
tree, and we derived equations for the type- and level-dependent offspring distribution. 
These results turn out to be very useful to analyze conditioned trees. Also, using our 
explicit construction procedure we can directly generate a tree that is conditioned to 
satisfy some property that has very low probability, which should also be useful for 
simulation purposes. 


16 







References 


[1] E.A. Cator, H. Don - Constructing and searching conditioned Galton-Watson 
trees. arXiv:1412.5890. 

[2] J. Geiger, G. Kersting - The Galton-Watson tree conditioned on its height. In: 
Probability Theory and Mathematical Statistics (Vilnius, 1998). (TEV, Vilnius, 
1999), 277-286. 

[3] J. Geiger - Elementary new proofs of elassical limit theorems for Galton-Watson 
processes. J. Appl. Probab. 36 (1999), 301-309. 

[4] S. Janson - Simply generated trees, conditioned Galton-Watson trees, random al¬ 
locations and condensation. Probability Surveys, Vol. 9 (2012), 103-252. 

[5] Lyons, R. with Peres, Y. ~ Probability on Trees and Networks. Gambridge Uni¬ 
versity Press. In preparation (2015). Gurrent version available at 

http://pages. in .edu/~rdlyons/, 


17 


